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When  estimating  models  with  pooled  cross-section 
and  time-series  data  (e.g.  estimating  demand  equa¬ 
tions  for  all  50  states)  one  has  to  decide  whether 
or  not  to  pool  the  data.  The  usual  procedure  is 
to  first  test  for  the  overall  homogeneity  (equal¬ 
ity)  of  the  coefficients.  If  this  hypothesis  is 
not  rejected,  then  a  single  equation  is  estimated 
with  pooled  data.  If  the  hypothesis  is  rejected, 
further  hypothesis  testing  may  be  necessary.  For 
example,  if  the  model  contains  more  than  one  co¬ 
efficient  the  equality  constraint  may  be  rejected 
for  only  a  subset  of  the  coefficients.  In  this 
case  che  data  is  pooled  and  dummy  variables  are 
used  with  the  subset  of  coefficients  for  which  the 
equality  constraint  does  not  hold.  There  are  at 
least  three  problems  with  this  procedure  of  pooling 
(or  not  pooling)  after  some  preliminary  tests  of 
significance.  First,  as  noted  in  Maddala  (-W-P—  r 
pp.  332-333)  it  raises  problems  about  inference 
from  the  pooled  model.  Second,  there  is  che  re¬ 
lated  question  of  what  significance  level  to  use 
when  deciding  whether  or  not  to  pool.  Third,  the 
choice  of  estimates  to  select  from  is  quite  lim¬ 
ited.  That  is,  one  must  pick  either  the  pooled  or 
the  non-pooled  estimate,  even  if  these  two  esti¬ 
mates  are  very  different.  These  problems  suggest 
that  an  alternative  (or  hybrid)  method  of  handling 
pooled  cross-section  and  time-series  data  is  needed. 
The  purpose  of  this  paper  is  to  propose  such  a 
method. 

I 

In  Section  I  the  hybrid  procedure  is  presented. 

In  Section  II  the  predictive  power  of  the  hybrid 
estimates  are  compared  with  the  predictive  power 
of  the  pooled  and  non-pooled  estimates.  Section 
III  contains  the  conclusions. 


I.  The  Hybrid  Estimation  Technique 
Consider  the  simple  model: 

U)  xlt"VbiXit+Uit;  1-1 . N; 

where  ■v  NCO.o^2).  The  i  subscript  represents 

N  cross-sections  and  the  t  subscript  represents  T^ 
observations  for  the  1th  cross-section.  Since 
equatlon(i) is  to  be  estimated  with  pooled  cross- 
section  and  time-series  data,  three  general  ap¬ 
proaches  to  the  estimation  (and  prediction)  problem 
are  possible. 


In  the  first  approach,  ordinary  least  squares  is 
applied  to  each  of  the  N  cross-sections  separately. 
That  is,  the  OLS  estimates  a,  and  b,  are  calculated 
from:  Sy  _ 
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However,  this  approach  seems  quite  restrictive. 
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The  estimates  for  each  individual  cross-section 
are  based  only  on  the  information  contained  in  the 
sample  for  that  particular  cross-section.  This 
implicitly  assumes  each  cross-section  is  so  dif¬ 
ferent  that  nothing  can  be  gained  by  observing 
analogous  behavior  in  other  cross-sections.  A 
more  reasonable  assumption  might  be  that  useful 
information  about  behavior  in  any  one  cross-sec¬ 
tion  is  contained  in  the  behavior  of  other  cross- 
sections. 


In  the  second  approach,  the  dataware  pooled.  That 
is,  the  pooled  estimates  a  and  b  are  calculated 
from:  T  P  P 
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This  approach  has  the  advantage  of  using  all  the 
information  in  the  sampel  to  estimate  each  cross- 
section's  coefficients.  The  disadvantage  is  that 
the  coefficients  for  each  cross-section  are  con¬ 
strained  to  equality.  This  seems  quite  restric¬ 
tive  in  that  all  cross-sections  are  assumed  to  be 
exactly  alike  with  respect  to  the  behavior  repre- 
I  sented  in  equation  1. 

The  usual  procedure  is  to  choose  between  the  first 
and  second  approach  on  the  basis  of  an  F-test.1 
However,  both  of  these  choices  seem  extreme.  In 
addition,  it  is  not  clear  which  approach  will  pro¬ 
vide  better  predictions.  All  this  suggests  that  a 
method  which  combines  the  advantages  of  the  first 
two  approaches  might  be  a  better  alternative.  The 
third  approach  to  the  estimation  and  prediction 
problem  is  thus  a  hybrid  of  the  first  two  approach¬ 
es.  2  In  this  third  approach,  a  time  series  equa¬ 
tion  is  first  estimated  separately  for  each  cross- 
section,  and  then  these  coefficients  are  adjusted 
toward  the  common  coefficient  from  the  pooled  re¬ 
gression.  The  purpose  of  this  approach  is  to  use 
all  the  available  information  in  the  estimation  of 
the  coefficients  for  each  cross  section,  but  at 
the  same  time  to  allow  for  ill  differences 

among  individual  cross-Fec  • 


The  hybrid  method  can  be  briefly  u.-scribed  as  fol¬ 
lows.  Assume  that  there  are  two  pieces  of  infor¬ 
mation  about  the  true  parameter  b, ,  where  b^  is  a 
coefficient  to  be  estimated  for  the  1th  cross- 
section.  The  first  piece  of  information  b^,  where 
b,  is  the  ordinary  least  squares  estimate  of  b. 
based  only  on  the  observations  for  the  if*1  cross- 
section.  The  second  piece  of  information  is  b  , 
where  b  is  the  estimated  coefficient  when  theP 
data  ari  pooled  over  all  cross-sections.  Accor¬ 
ding  to  the  hybrid  approach,  a  reasonable  estimate 
of  b,  is  a  weighted  average  of  b,  and  b  .  More 
specifically,  the  hybrid  estimates  bf  and  af  are 
defined  as: 


wlmm  man  td 


b" 


t  t  2  p 

■ 


bj  Xt. 


*.*,  y.y./tVD 

(-4-^j  h-M— * - }. 

1  ;  .  1  .. 


»  =  (sv  v  /  1 

1-1  ylyi'(T.-l) 


il)>  -\  “  (^n'^M^). 


I 

S-l 


Z  (b  -b  )*  -  -  Z 
v  1  o'  N 
1-1  1  P  1-1 


-  2 
°1 


(15) 


1  -  2 
E  (Y.  -  Y.)\ 

1-1  1  1 
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where  V  's  are  obtained  from  the  N  cross-section 
regressions.  If  only  a  subset  (K  )  of  the  K  co- 
eft  lc loots  are  assumed  heterogeneous,  then  che 
vectors  b ^  are  estimated  from  a  pooled  regression, 
but  dummy  variables  are  used  on  the  heterogeneous 
subset  b  .  in  this  case  the  term 
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(18)  df  -  degrees  of  freedom  in  the  pooled  with 
du=lespregresslon.  All  other  terms,  Including 
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one  Independent  variable. 


In  the  single  variable  model,  W  depends  on  three 
**r=*:  (1)  the  standard  error  of  the  regression 

of  the  It"  cross-section;  (2)  the  variance  of  the 
dependent  variable  In  the  i1”  cross-section  rela¬ 
tive  to  the  average  variance  for  all  cross-sec¬ 
tions;  and  (J)  the  sum  of  squared  deviations  of 
the  Independent  variable  for  the  l*h  cross-section 
*•  the  standard  error  of  the  regression  for  the 
1th  cross-section  Is  smaller,  W,  Is  larger.  As 
the  relative  variance  of  the  dependent  variable 
Increases,  V,  Increases.  W,  la  also  greater,  che 
treater  Is  the  sum  of  square  deviations  of  the 
Independent  variable.  This  method  of  determining 
~l  Is  Intuitively  eppeallng.  The  better  the  fit 
for  an  Individual  cross-section,  as  measured  by 


a  smaller  stnadard  error  of  the  regression,  the 
closer  b*  is  u  b^.  However,  the  standard  error 
of  the  regression^depends  on  the  variance  of  the 
dependent  variable.  To  correct  for  the  fact  that 
the  standard  error  of  the  regression  may  be  small 
(large)  just  because  the  variance  of  the  dependent 
variable  is  small  (large),  the  relative  variance 
of  the  dependent  variable  is  also  included  in  W^. 
Finally,  as  the  variance  of  the  independent  var¬ 
iable  increases  (or  as  the  number  of  observations 
increases),  more  information  is  contained  in  the 
estimate  of  6  .  Therefore,  as  the  sum  of  squared 
deviations  of  the  independent  variable  increases, 
more  weight  is  placed  on  b^.® 

Wj  depends  on  the  variability  of  the  b^'s  around 
o  relative  to  the  average  variance  of  the  b^'s. 
a!  this  relative  variance  increases,  becomes 
smaller  and  less  weight  is  placed  on  b  .  This  is 
also  intuitively  appealing.  As  the  variability 
of  the  b^s  around  6  Increases,  there  is  less 
reason  to  think  thatpb  contains  useful  informa¬ 
tion  about  the  Individual  b, .  On  the  other  hand, 
as  the  average  variance  of  the  estimated  b  's 
increases,  there  is  less  reason  to  think  that  each 
b,  contains  useful  Information  about  the  indivi¬ 
dual  b^ 

In  summary,  the  hybrid  procedure  adjusts  each  b^ 

toward  b  ,  but  puts  more  weight  on  b,  as  S 
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is  larger,  o^  is  smaller, 

S 

ylyl/ (T.-l)  and  Z  is  larger.  When  the 

T  N 

N  E  [S„  „  1 

1-1 


y.y 


i'i/ 


(Tt-i) 


model  contains  more  than  one  independent  variable 
the  procedure  is  similar  except  for  the  slight 
differences  noted  earlier.® 


II.  An  Application:  Two  Case  Studies 
In  this  section  the  hybrid  procedure  Is  applied 
to  two  different  case  studies.  The  two  cases  to 
be  analyzed  arise  in  entirely  different  contexts, 
but  both  involve  the  use  of  pooled  cross-section 
and  time-series  data  to  investigate  current  policy 
problems.  The  first  case  Is  the  prediction  of 
Inflation  In  fifteen  Latin  American  countries  over 
the  period  1973-1975,  based  on  coefficients  esti¬ 
mated  from  the  1954-1972  period.  The  second  case 
Is  the  prediction  of  receipts  for  49  state  govern¬ 
ments  for  fiscal  year  1975  based  on  coefficients 
from  the  1958-1974  period.  In  both  cases  predic¬ 
tions  based  on  pooled  regressions  are  compared 
with  predictions  based  on  individual  (country  or 
state)  regressions  and  with  predictions  based  on 
the  adjusted  coefficients  as  suggested  by  the  hy¬ 
brid  method. 

Case  1:  Inflation  In  Latin  America 


The  equation  to  be  estimated  In  the  first  case 
study  was  initially  formulated  by  Harberger  (1963) 
In  his  study  of  Chilean  Inflation  and  subsequently 
estimated  by  Vogel  (1974)  for  sixteen  Latin  Ameri¬ 
can  countries.  The  equation  is: 

(19)  P>  -  b0+bl  M’+b2  M'.j+bj  Y;*b, 
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where  M  is  the  money  supply,  Y  Is  real  GDP,  P  is 
the  consumer  price  index,  primes  indicate  annual 
percentage  changes,  subscripts  Indicate  current  or 
lagged  variables,  u  is  the  error  term,  and  b 
through  b^  are  the  coefficients  to  be  estimated. 

This  equation  has  been  estimated  for  fifteen  Latin 
American  countries  over  the  period  1954  through 
1972.7  Based  on  the  results  from  the  individual 
country  regressions  and  the  pooled  regression,  the 
F-test  indicates  that  the  data  should  not  be  pool¬ 
ed.  A  similar  conclusion  was  reached  by  Vogel 
(1974).  However,  it  is  not  clear  which  approach 
will  provide  better  predictions.  Moreover,  Vogel 
(1974)  emphasizes  that  allowing  all  coefficients 
to  differ  among  countries  (i.e.  not  pooling)  only 
improves  the  R*  by  .03  over  the  obtained  in  the 
pooled  regression.  For  the  countries  and  time 
period  covered  in  the  present  study  the  improve¬ 
ment  in  R^  is  about  .06.  These  findings  suggest 
that  heterogeneity  among  the  countries  is  quite 
limited  —  a  surprising  result  in  light  of  the 
widely  differing  Inflationary  experiences  of  Latin 
American  countries  and  the  Importance  frequently 
attributed  to  the  structural  factors  excluded  from 
the  model. 

The  preceedlng  discussion  suggests  that  comparing 
the  predictive  power  of  the  pooled,  non-pooled  and 
hybrid  estimates  would  be  a  useful  exercise.  These 
predictions  are  presented  in  Table  1.  The  predic¬ 
tions  are  based  on  coefficients  estimated  over  the 
period  1954  through  1972,  and  the  rate  of  inflation 
has  been  predicted  for  each  of  the  fifteen  Latin 
American  countries  for  the  period  1973  through 
1975.  For  each  set  of  predictions,  three  types  of 
statistics  are  given:  mean  error,  mean  absolute 
error,  and  mean  square  error.  These  statistics 
are  reported  for  each  country  individually,  for 
the  aggregate  of  all  fifteen  Latin  American  coun¬ 
tries,  and  for  the  aggregate  excluding  Chile. 

Chile  completely  dominates  the  aggregate  results 
because  of  its  astronomical  rate  of  inflation  which 
averaged  more  than  400  percent  per  year  over  the 
1973-1975  period.  Hence  it  is  useful  to  examine 
the  aggregate  results  excluding  Chile  as  well  as 
the  results  for  each  individual  country.8 

The  coefficients  from  the  pooled  regression  provide 
the  best  predictions  in  the  aggregate  including 
Chile,  but  only  because  the  underprecition  of 
Chilean  inflation  is  slightly  less  serious  with 
this  method  than  with  the  alternative  methods. 

When  Chile  is  excluded,  all  three  methods  give 
much  better  predictions  in  the  aggregate.  In 
particular,  the  coefficients  from  the  pooled  re¬ 
gression  underpredict  the  rate  of  inflation  by 
only  a  little  more  than  1  percent  on  the  average, 
while  the  hybrid  method  underpredicts  by  just 
over  2  percent.  These  results  seem  quite  good 
considering  that  average  rates  of  inflation  ranged 
from  approximately  8  percent  in  Honduras  and  Vene¬ 
zuela  to  more  than  80  percent  in  Argentina  and 
Uruguay  for  the  1973-1975  period.  The  coefficients 
from  the  individual  country  regressions  yield  the 
predictions  with  the  lowest  mean  square  error  in 
the  aggregate  excluding  Chile,  but  the  mean  square 
error  for  the  hybrid  method  is  only  slightly  high¬ 
er.  Overall,  the  hybrid  method  appears  to  give 
the  best  predictions  in  the  aggregate  excluding 
Chile.  Not  only  is  the  hybrid  method  a  very  close 


second  in  terms  of  the  mean  ■-  rrcr  and  mean  square 
error  criteria,  but  it  also  produces  the  lowest 
mean  absolute  error. 

Turning  to  the  results  for  the  individual  coun¬ 
tries,  none  of  the  methods  clearly  predominates  In 
predicting  Inflation  in  the  five  Latin  American 
countries  which  historically  have  experienced  high 
rates  of  inflation.  As  indicated  above,  the  co¬ 
efficients  from  the  pooled  regression  yield  the 
best  predictions  for  Chile,  and  the  same  is  true 
for  Brazil.  On  the  other  hand,  the  coefficients 
from  the  Individual  regressions  yield  the  best 
predictions  for  Argentina,  Bolivia,  and  Uruguay. 
However,  for  Bolivia  the  hybrid  method  gives 
equally  good  predictions,  and  for  Uruguay  only  one 
observation  is  available  for  comparison. 

For  the  ten  Latin  American  countries  with  moderate 
or  low  rates  of  inflation,  the  results  are  much 
more  clearcut.  For  all  of  these  countries  except 
Venezuela  the  coefficients  from  the  pooled  regres¬ 
sion  or  the  hybrid  method  give  the  best  predic¬ 
tions  of  inflation.  Moreover,  the  coefficients 
from  the  individual  regressions  give  the  worst 
predictions  for  eight  of  these  ten  countries. 

It  is  thus  apparent  that  useful  information  about 
the  behavior  of  inflation  in  individual  Latin 
American  countries,  especially  moderate  and  low 
inflation  countries,  can  be  obtained  by  examining 
the  behavior  of  Inflation  in  the  other  Latin 
American  countries. 

Case  2:  A  Model  of  State  Government  Receipts 

In  the  second  case  study,  the  total  receipts  of 
forty-nine  state  governments  are  analyzed.  The 
equation  to  be  estimated  is: 

(20)  Total  Receiptst  “  b^  Potential  CNPt+b2  Real 

Income^  +  b-  Deflator  +  b.  Surplus  ,  +  u 
t  3  t  4  t-1  t 

where  all  variables  are  measured  in  percentage 
changes,  except  the  surplus  which  is  the  level  in 
the  preceding  year;  u(  is  the  error  term,  and  b^ 
through  b,  are  the  coefficients  to  be  estimated; 
The  coefficient  of  potential  GNP  indicates  the 
long-run  elasticity  of  total  receipts  with  respect 
to  the  growth  in  potential  output.  The  most  im¬ 
portant  coefficient,  b^,  indicates  the  elasticity 
of  total  receipts  with  respect  to  short-run  fluc¬ 
tuations  in  real  Income.  These  fluctuations  in 
economic  activity  are  measured  by  state  real  per¬ 
sonal  income  (where  state  personal  income  is  de¬ 
flated  by  the  implicit  price  deflator  for  GNP). 

The  elasticity  of  total  receipts  with  respect  to 
changes  in  the  price  level  is  indicated  by  the 
coefficient  of  the  deflator,  which  is  measured  by 
the  implicit  price  deflator  for  state  and  local 
purchases  of  goods  and  services.  Finally,  the 
level  of  the  surplus  (or  deficit)  in  the  preceding 
year  is  measured  for  each  state  by  local  receipts 
less  total  expenditures  as  a  percentage  of  total 
expenditures.  This  variable  is  Introduced  because 
the  greater  the  level  of  the  surplus  (deficit)  in 
the  preceding  year,  the  less  (more)  will  be  the 
need  for  tax  Increases  during  the  ensuring  year, 
and  hence  the  smaller  (larger)  will  be  the  per¬ 
centage  Increase  in  total  receipts. 
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The  statistical  analysis  which  follows  covers  only 
49  of  the  SO  state  governments.  Alaska  has  been 
excluded  because  of  very  large  and  erratic  grants 
from  the  Federal  Government.  Data  on  the  indivi¬ 
dual  states  are  from  the  Bureau  of  the  Census. 

State  Government  Finances,  for  the  fiscal  year 
1957  through  1975.  All  states  but  three  have  fis¬ 
cal  years  which  begin  on  July  1  and  end  on  June 
30,  and  for  the  independent  variables  quarterly 
averages  to  conform  to  these  fiscal  years  have 
been  used.  Total  receipts  of  each  state  are 
measured  net  of  receipts  of  state  unemployment 
compensation  funds  and  net  of  receipts  of  state 
social  insurance  funds  (e.g.,  public  employee  re¬ 
tirement  systems).  These  funds  have  been  excluded 
because  they  are  handled  separately  and  cannot  be 
mingled  with  general  state  funds. 

All  variables  have  been  expressed  in  percentages 
for  a  variety  of  reasons,  but  the  most  important 
reason  is  so  that  the  data  can  readily  be  pooled 
across  states.  In  addition,  when  the  variables 
are  expressed  as  percentage  changes,  the  estimated 
coefficients  are  elasticities  which  are  constant 
for  all  values  of  the  dependent  and  independent 
variables . 

To  test  for  the  homogeneity  of  all  coefficients  in 
equation  20,  the  F-test  for  reduction  in  residual 
sums  of  squares  is  1.33  (with  degrees  of  freedom 
192,637).  Based  on  the  Interpolation  of  the  F- 
tables,  this  is  significant  at  the  1  percent  level. 

To  explore  further  this  apparent  heterogeneity 
among  states,  F-ratlos  have  been  calculated  for 
the  Individual  coefficients.  When  the  coefficients 
for  state  real  personal  income  are  allowed  to  dif¬ 
fer  among  states,  but  the  remaining  coefficients 
are  required  to  be  the  same  across  states,  the  F- 
ratlo  for  reduction  in  residual  sums  of  squares  is 
1.73  (with  degrees  of  freedom  48,  781).  This  is 
significant  at  the  1  percent  level  by  a  comfortable 
margin.  Thus,  the  response  of  receipts  to  fluc¬ 
tuations  in  economic  activity  does  differ  signifi¬ 
cantly  among  states  when  economic  f luctutatlons 
are  measured  by  state  real  personal  income.  When 
the  test  for  significant  reduction  in  residual 
sums  of  squares  is  applied  to  the  remaining  co¬ 
efficients,  given  that  the  coefficients  of  state 
real  personal  Income  differ  among  states,  Che  F- 
racio  is  1.18  (with  degrees  of  freedom  144,685). 
This  is  not  significant  at  the  5  percent  level,  so 
the  response  of  receipts  to  potential  GNP,  the 
deflator,  and  the  surplus  are  assumed  to  be  homo¬ 
geneous  across  states. 

To  test  the  predictive  ability  of  che  pooled,  non- 
pooled  and  hybrid  estimates,  the  percentage  change 
in  receipts  for  each  of  the  49  states  has  been 
predicted  for  fiscal  1975.  These  predictions  are 
presented  in  Table  2.  They  are  based  on  the  co¬ 
efficients  estimated  over  the  period  fiscal  1958 
through  fiscal  1974.  1  For  each  set  of  predic¬ 
tions,  three  statistics  are  given!  mean  error, 
mean  square  error  and  mean  absolute  error.  The 
results  are  clearcut.  All  three  approaches  do 
equally  well  on  mean  absolute  error.  The  pooled 
coefficients  underpredict  the  percentage  Increase 
in  state  receipts  by  somewhat  less  than  the  indi¬ 
vidual  state  coefficients  for  state  real  personal 
Income,  but  the  pooled  coefficients  produce  a 
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slightly  larger  mean  square  error.  However,  che 
hybrid  method  yields  both  the  smallest  underpre¬ 
diction  and  the  lowest  mean  square  error.  These 
results  suggest  chat  the  coefficients  of  state 
real  personal  Income  produced  oy  the  hybrid  method 
provide  a  better  Indication  of  the  responsiveness 
of  state  government  receipts  to  fluctuations  in 
economic  activity  than  either  the  pooled  or  indi¬ 
vidually  estimated  coefficients. 

III.  Conclusions 

In  this  paper  we  have  proposed  a  hybrid  estima¬ 
tion  technique  that  is  useful  when  analyzing 
pooled  cross-section  and  time-series  data.  The 
hybrid  estimates  are  a  weighted  combination  of  the 
pooled  and  Individual  regression  coefficients. 

The  idea  of  this  approach  is  to  use  all  the  avail¬ 
able  information  for  the  estimation  of  each  re¬ 
gression  parameter,  while  at  the  same  time  allow¬ 
ing  for  potential  differences  that  may  exist  be¬ 
tween  the  different  cross-sections.  To  test  this 
method,  we  compared  its  predictive  ability  to  the 
predictive  ability  of  both  the  pooled  and  indivi¬ 
dual  regression  coefficients.  This  predictive 
test  was  performed  on  two  separate  models:  a 
model  of  inflation  in  fifteen  Latin  American 
countries,  and  a  model  of  dollar  receipts  for  49 
state  governments.  In  both  Instances  the  hybrid 
estimates  yielded  better  prediction  than  either 
the  pooled  or  individual  regression  estimates. 

FOOTNOTES 

1.  The  appropriate  F-tests  for  pooling  are  well 
known;  see,  for  example,  Maddala  (1971). 

2.  An  approach  similar  to  the  one  used  in  this 
paper  is  discussed  in  Maddala  (1977;  pp.  332- 
333). 

3.  A  negative  Z  Implies  that  is  very  large. 
Consequently,  b*  should  simply  be  the  pooled 
estimate. 

4.  Of  course,  if  only  a  subsetof  the  coefficients 
are  assumed  heterogeneous,  o  2  is  as  defined 

in  (17).  Also,  if  no  Intercept  term  appears 
in  the  model,  the  degrees  of  freedom  in  (17) 
are  Tj-K. 

5.  Consider,  for  example,  a  simple  model  where 
Inflation  is  a  function  of  money  supply  in 
each  of  several  countries.  If  in  a  particular 
country  there  is  little  variation  in  the  money 
supply,  little  information  can  be  obtained 
about  the  effect  of  money  on  prices,  and  hence 
relatively  little  weight  should  be  placed  on 
bg.  On  the  other  hand,  as  the  variance  or  the 
number  of  observations  increases,  more  infor¬ 
mation  is  provided  about  money  and  prices  in 
that  particular  country,  so  more  weight  should 
be  placed  on  b^. 

6.  With  more  than  one  explanatory  variable  each 
b,k  is  adjusted  toward  b  .  Independently  of 
the  remaining  K-l  coefficients.  That  is,  the 
covariance  terms  S  (k  i»  J)  are  not  taken 

Vj 

in  the  b*.  calculation. 
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7.  The  data  are  taken  from  the  International 
Monetary  Fund,  International  Financial  Statis¬ 
tics  (Washington,  D.  C. :  May  1977).  The  re¬ 
gressions  begin  with  the  observations  for  1954, 
rather  than  1952,  because  of  the  presence  of 
lagged  variables  in  the  equation.  For  Uruguay 
and  Honduras  the  initial  observations  are  re¬ 
spectively  1956  and  1957  because  sone  of  the 
earlier  data  are  not  available  for  these  two  ' 
countries. 

8.  While  Chile's  astronomical  inflation  can  be 
adequately  explained  by  monetary  factors  or  by 
the  economic  chaos  surrounding  the  military 
coup  and  Allende's  death  is  debatable.  In  any 
case,  it  is  not  appropriate  to  evaluate  the 
aggregate  results  solely  on  the  ability  (or 
Inability)  to  predict  Inflation  in  only  one 

of  the  fifteen  countries. 

9.  For  Uruguay  the  rate  of  inflation  can  be  pre¬ 
dicted  only  for  1973  because  money  supply  data 
are  not  available  for  1974  and  1975. 

10.  It  may  be  that  Venezuela's  role  as  a  major  oil 
exporter  makes  the  inflationary  behavior  of 
the  other  Latin  American  countries  Irrelevant. 
Moreover,  the  only  other  country  in  the  low 
and  moderate  Inflation  group  for  which  the  co¬ 
efficients  from  the  Individual  regressions  do 
not  give  the  worst  prediction  is  Ecuador,  the 
only  Latin  American  member  of  O.P.E.C. 

11.  To  make  the  predictions  of  all  three  methods 


dirvctly  comparable,  the  coefficients  from  the 
polled  regression  were  always  used  for  the 
deflator,  potential  CNP  and  surplus  variables. 
This  was  done  to  remove  all  effects  on  the 
predictions  that  are  not  a  direct  result  of 
the  three  different  approaches  used  to  esti¬ 
mate  bj. 
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Table  1 


MEAN  SQUARE  ERRORS  FROM  PREDICTIONS  OF  INFLATION 

FOR  FIFTEEN 

LATIN  AMERICAN 

Individual  Regressions 

COUNTRIES.  1973-1975** 

Pooled  Regressions 

Hybrid  Method 

Mean  Square  Error 

Mean  Square  Error 

Mean  Square  Error 

Aggregate 

.2890 

.2701 

.4473 

Aggregate 

excluding 

Chile  .0186 

.0282 

.0190 

Uruguay* 

.1192 

.1491 

.1279 

Bolivia 

.0278 

.0403 

.0280 

Chile 

3.8945 

3.4946 

6.1571 

Brazil 

.0182 

.0060 

.0179 

Argentina 

.0691 

.1917 

.1184 

Paraguay 

.0051 

.0033 

.0039 

Colombia 

.0076 

.0045 

.0056 

Peru 

.0048 

.0044 

.0007 

Mexico 

.0062 

.0023 

.0029 

Ecuador 

.0023 

.0033 

.0020 

Costa  Rica 

.0298 

.0037 

.0112 

Honduras 

.0052 

.0031 

.0023 

Venezuela 

.0081 

.0629 

.0162 

El  Salvador 

.0098 

.0011 

.0007 

Guatemala 

.0136 

.0003 

.0013 

*For  Uruguay  the  rate  of  inflation  is  predicted  only  for  1973  because  necessary 
data  are  not  available  for  1974  and  197S. 

**The  countries  are  arranged  from  highest  to  lowest  rates  of  Inflation  during  the 
1952-1972  period. 

For  lack  of  space  the  mean  error  and  mean  absolute  error  are  not  reported  here. 


Table  2 

COMPARISON  OF  PREDICTIONS  OF  RECEIPTS  FOR  49  STATE 
GOVERNMENTS,  1975 

Mean  Error  Mean  Square  Error  Mean  Absolute  Error 
Pooled  Regression  .0100  .00381  .0486 

Individual  Coefficients  for 

State  Real  Personal  Income  .0132  .00360  .0478 

Hybrid  Method  .0094  .00359  .0481 
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